UWN: A Large Multilingual Lexical Knowledge Base
نویسندگان
چکیده
We present UWN, a large multilingual lexical knowledge base that describes the meanings and relationships of words in over 200 languages. This paper explains how link prediction, information integration and taxonomy induction methods have been used to build UWN based on WordNet and extend it with millions of named entities from Wikipedia. We additionally introduce extensions to cover lexical relationships, frame-semantic knowledge, and language data. An online interface provides human access to the data, while a software API enables applications to look up over 16 million words and names.
منابع مشابه
Towards Universal Multilingual Knowledge Bases
Lexical, ontological, as well as encyclopedic knowledge is increasingly being encoded in machine-readable form. This paper deals with knowledge representation in multilingual settings. It begins by proposing a generic graph-based knowledge base framework, and then, in three case studies, explains how preexisting knowledge can be cast into this framework. The first case study involves enriching ...
متن کاملJoining Forces Pays Off: Multilingual Joint Word Sense Disambiguation
We present a multilingual joint approach to Word Sense Disambiguation (WSD). Our method exploits BabelNet, a very large multilingual knowledge base, to perform graphbased WSD across different languages, and brings together empirical evidence from these languages using ensemble methods. The results show that, thanks to complementing wide-coverage multilingual lexical knowledge with robust graph-...
متن کاملLexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
The last two decades have seen the development of various semantic lexical resources such as WordNet (Miller, 1995) and the USAS semantic lexicon (Rayson et al., 2004), which have played an important role in the areas of natural language processing and corpus-based studies. Recently, increasing efforts have been devoted to extending the semantic frameworks of existing lexical knowledge resource...
متن کاملMultilingual Lexical Representation
The approach to multilingual lexical representation developed as part of the ACQUILEX Lexical Knowledge Base (LKB) discussed with specific reference to complex translation equivalence. The treatment described provides a lexicalist account of translation mismatches in terms of translation links which capture cross-linguistic generalizations across sets of semantically related lexical items, and ...
متن کاملMultilingual WSD with Just a Few Lines of Code: the BabelNet API
In this paper we present an API for programmatic access to BabelNet – a wide-coverage multilingual lexical knowledge base – and multilingual knowledge-rich Word Sense Disambiguation (WSD). Our aim is to provide the research community with easy-to-use tools to perform multilingual lexical semantic analysis and foster further research in this direction.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012